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Abstract 

Imagine that there are two bins to which balls are added sequentially, and each 
| incoming ball joins a bin with probability proportional to the pth power of the number 

of balls already there. A general result says that if p > 1/2, there almost surely is some 
bin that will have more balls than the other at all large enough times, a property that 
we call eventual leadership. 

In this paper, we compute the asymptotics of the probability that bin 1 eventually leads 
when the total initial number of balls t is large and bin 1 has a fraction a < 1/2 of 
£f~) | the balls; in fact, this probability is exp(c p (a)i + O (t 2 ^ 3 )) for some smooth, strictly 

\Q . negative function c p . Moreover, we show that conditioned on this unlikely event, the 

fraction of balls in the first bin can be well-approximated by the solution to a certain 
ordinary differential equation. 



in 

^ ■ 1 Introduction 

Consider a discrete-time process in which there are two bins, to which balls are added one 
at a time. Each incoming ball chooses probabilistically which bin to go to according to the 
following rule: if bin 1 currently has n\ balls and bin 2 has n% balls, then the probability 
IJji ' that bin 1 is chosen is 

><: f_M 

/(m) + /(n 2 )' 

where / is a fixed positive function. These so-called balls-in-bins processes with feedback 
function / , which can be generalized to more than two bins (cf. Section El below) were 
introduced to the Discrete Mathematics community by Drinea, Frieze and Mitzenmacher 

*IBM T.J. Watson Research Center, Yorktown Heights, NY 10598. riolivei@us.ibm.com, 
rob.oliv@gmail.com. Done while the author was a Ph.D. student at New York University under the 
supervision of Joel Spencer. Work funded a CNPq doctoral scholarship. 

Courant Institute of Mathematical Sciences, New York University, New York 10012, NY. Email: 
spencer@cims . nyu . edu. 

1 The first author's thesis [5] contains a longer background discussion of this and related processes. 



1 



UJ. This family of processes was intended as a model for competition that is mathematically 
similar to some so-called preferential attachment models for large networks [51^01]. 

The authors of [5] were especially interested in the case f{x) = x p with p > a 
parameter. In this case, there is a tendency that the rich get richer: since / is increasing, 
the more balls a bin has, the more likely it is to receive the next ball. One of the main 
questions addressed in [BJ is whether this phenomenon results in effective preponderance by 
one of the bins in the long run. They proved that the answer is "yes" if p > 1 and "no" if 
p < 1. That is, if p > 1 then one of the two bins will obtain a 1 — o (1) fraction of all balls 
in the large-time limit, whereas if p < 1 the fractions of balls in the two bins both tend 
to 1/2. The case p = 1 is the well-known Polya Urn, in which case the limiting number 
of balls in bin 1 has a non-degenerate distribution depending on the initial conditions, so 
the result in seemingly completes the description of the family of processes given by the 
choices of p. 

However, stronger results are available. A paper by Khanin and Khanin [7] introduced 
what amounts to the same process as a model for neuron growth, and proved that if p > 1, 
there almost surely is some bin that gets all but finitely many balls, an event that we call 
monopoly. They also show that for 1/2 < p < 1, monopoly has probability 0, but there 
almost surely will be some bin which will lead the process from some finite time on (we call 
this eventual leadership), whereas this cannot happen if < p < 1/2. In fact, the result of 
[2j generalizes to any / with mm x£ ^ f(x) > 0, as shown e.g. in [TH IHj ITU]. 

Theorem 1 (From [3 Q31 HO [10J) //{/ m }+= 

is a balls-in-bins process and feedback func- 
tion f = f(x) > c for some c > 0, then there are three mutually exclusive possibilities, one 
of which happens almost surely irrespective of the initial conditions: 

1. if Yl n >i /( n ) _1 < +oO; of the bins receives all but finitely many balls (this is the 
monopolistic regime ); 

^- */ J2 n >i /( n ) 1 = +°° but J2 n >i f( n ) < +oo, monopoly does not happen but one 
of the bins has more balls than the other at all large enough times (this is the eventual 
leadership regime,); 

3. ifJ2 n >i f( n ) 2 = +°°; the balls alternate in leadership infinitely many times (this is 
the almost-balanced regime ). 

Notice that the three cases of the Theorem applied to the f(x) = x p family correspond to 
p > 1, 1/2 < p < 1 and < p < 1/2; in other words, this family of / has phase transitions 
at p = 1 and p = 1/2. 

The present paper is part of a series of works by the two authors and by Michael Mitzen- 
macher in which several more quantitative aspects of the three regimes are elucidated. We 
are especially concerned with the eventual leadership and monopoly regimes, where there 
are initially t balls in the system and bin 1 has a fraction a € (0, 1/2) of those balls. It 
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is easy to show that bin 1 has a positive probability of eventually leading the process, but 
this probability should get smaller and smaller as t increases. Thus we ask ourselves two 
simple questions: 

1. How fast does the probability that bin 1 will escape its unfavorable initial conditions 
and eventually lead converge to as t — ► +oo? 

2. What is the typical behavior of the process, given that bin 1 does escape? 

Our two main results apply to the case f(x) = x p , p > 1/2. We show that the answer to 
the first question is "exponentially small" and compute the exact rate of decay. Below, let 
[t, a] denote the pair ( \at] , t — \at\). 

Theorem 2 Assume that we have a balls-in-bins process with feedback function f(x) = x p , 
p > 1/2 (so that the strong eventual leadership condition holds), and let ELead be the event 
that the first bin eventually leads the process. Then, for all fixed a G (0, 1/2), the limit 

InPrr* „i (ELead) 
c p (a) = lim ^ 1 G R (1) 

exists, and is a smooth function of a satisfying c'(a) > on (0,1/2). Moreover, for any 
5 £ (0, 1/2) there exist C$ G R + and T$ G N such that 

Va G (5, 1/2 - 5),Vt > T s , e c ^ a) tVZ < Pr M (ELead) < e c ^ a) t+Cs . (2) 

The form of Theorem ^should be compared with that of Cramer's Theorem |15j . which 
estimates the exponential rate of decay of the probability of large deviations from the mean 
of sums of i.i.d. random variables. This analogy also applies to the proof of Theorem |21 
contains computations with Laplace transforms that resemble those used to prove Cramer's 
Theorem. In our case, however, the random variables we consider, although not i.i.d., are 
of a very specific kind. Theorem |^1 is proven in Section ^ below. 

Question 2. tunas out to have a more surprising answer than 1.. We will prove that 
conditioning on bin 1 escaping almost determines the behavior of the process, at least up to 
the time when bin 1 has half of the balls. To state this result precisely, define 

g p : (0,1/2) - 
a i ^ 

We will show below (cf. Remark that g p (a) > for all a G (0,1/2). This implies that 
the function A = A atP (-) solving the following ODE is increasing. 

/ = 9 P (A(s)), s>0 

\ A(0) = a [ ' 
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Such a solution is only guaranteed to exist for s G [0, T Pj0l ), where T P)Q , G R + U {+00}. 
Given that A Pt0l is increasing, T P)Q is finite if and only if lim s ^T p a A(s) = 1/2. In any case, 
there does exist some TfiaxiTfial Tp^ a as above, and A is uniquely defined function on 
[0,T Pia ). 

Our theorem can now be stated. 

Theorem 3 Consider a balls-in-bins process with feedback function fix) = x p , < p < 1/2 
started from initial conditions [t, a] . Let at(s) be the fraction of balls in bin 1 at time \s t\ , 
and let ELead be the event that bin 1 eventually leads. It then holds that for all K G R + 
satisfying K < T p ^ a , 

Pr M (VsG [0,K], \& t (s) - A p , a (s)\ < Wr l ' z | ELead) > 1 - e^'*) , t > 1. (5) 

Here, W is a constant depending on a and K , but not on t. 

Notice that the two possibilities presented above - T PiQ , < +00 or T p>a = +00 - are 
both a priori legitimate. In the former case, one would be able to show that the random 
function a{ ) conditioned on ELead converges weakly to A p ^ a in the space Z)[0,+oo) 0. 
In the latter case, for all e > 0, there would be a value of K = K e < T PjCr such that, 
with probability tending to 1 a(K e ) > 1/2 — e. It would be quite interesting to settle 
this matter: determining whether A(s) — > 1/2 as s converges to some finite T should only 
require a careful (but perhaps laborious) estimation of the RHS of g(a) for a near 1/2. The 
proof of Theorem |5] can be found in Section [5] below. 

Before we proceed, let us briefly discuss our proof techniques. This work employs 
the same fundamental tool as in the remaining papers in this series jSJ 111! HUj . as well 
as in other references |7| I14j (according to 0, the techique originated in Davis' work on 
reinforced random walks 4 ). We shall embed the discrete-time process we are interested 
in into a continuous-time process built from exponentially distributed random variables, so 
that inter-arrival times at different bins are independent and have an explicit distribution, 
which is very helpful in calculations. We call this the exponential embedding of the process. 
Our main conceptual contribution is to notice that the problems at hand lend themselves 
to proof via the exponential embedding method. 

The rest of the paper is organized as follows. After preliminaries are discussed in 
Section|5J Section describes the exponential embedding and its application to the eventual 
leadership event. The next two sections prove the main theorems, and Section |H1 discusses 
some open questions. 

2 Preliminaries 

General notation. Throughout the paper, N = {1, 2, 3, ... } is the set of non-negative inte- 
gers, R + = [0, +00) is the set of non-negative reals, and for any k G N\{0} [k] = {1, . . . , k}. 
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XA is the indicator function of a set (or event) A. 



Asymptotics. We use the standard O/o/Q/Q notation. The expressions "a ra ~ b n as 
n — ► tiq v and "o n <S b n as n — > n " mean that lim n _ >no (a n /& n ) = 1 and lim n _ >no (a n /6 n ) = 0, 
respectively. 

Balls-in-bins. Formally, a feedback function is a map / : N — ► (0, +oo) with positive 
minimum. A balls-in-bins process with feedback function / and B£N bins is a discrete- 
time Markov chain {(7i(m), . . . , Js(m))}^ with state space N B and transitions given 
as follows. For every time m > 1 there exists an index i m G [£?] such that I m (i m ) = 
Im-i(im) + 1 and 7 m (i) = 7 m _i(i) for i G [B]\{i m }. Moreover, the distribution of z m is 
given by 

Pr (i m = i I {Im'U) :0<m'<m,je [B]}) = ^J^7- - 

We will usually refer to the index i m £ [B] as the bin that receives a ball at time m. For 
any B, if E is an event of the process and u G N B , Pr u is the probability of E when 
the initial conditions are set to u. Finally, in the case B = 2, it will be convenient to use 
the notation [t,a>] (t G N, < a < 1 to denote the state ( \at] , t — \at] ) , i.e. there is a total 
of t balls in the bins, and the fraction of balls in bin 1 is (approximately) a. 

Exponential random variables. X = d exp(A) means that X is a random variable with 
exponential distribution with rate A > 0, meaning that X > and 

Pr(X >t) = e~ xt (t > 0). 

The shorthand exp(A) will also denote a generic random variable with that distribution. 
Some elementary but extremely useful properties of those random variables include 

1. Lack of memory. Let X = d exp(A) and Z > be independent from X. The distribu- 
tion of X — Z conditioned on X > Z is still equal to exp(A). 

2. Minimum property. Let {Xi = d exp(Ai)}^L 1 be independent. Then 

X min = min Xj = d exp(Ai + A 2 + . . . A m ) 

l<i<m 

and for all 1 < i < m 

Pr (AQ = X min ) = — (6) 
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3. Multiplication property. If X = d exp(A) and r) > is a fixed number, r\X = d exp(A/r/). 

4. Moments and transforms. If X = d exp(A), r G N and t G R, 

Ex[Xl = (7) 

Ex [ e «] = {t<X) (8) 

L J | +oo (t > A) W 

3 The exponential embedding 
3.1 Definition and key properties 

Let / : N — > (0, +oo) be a feedback function, B G N and (oi, . . . , ob) G N B . We define 
below a continuous-time process with state space (NU{+oo}) B and initial state (oi, ■ ■ ■ , ob) 
as follows. Consider a set {X(i, j) : i G [B], j G N} of independent random variables, with 
= d exp(/(j)) for all (i, j) G [B] x N, and define 

Ni(t) = sup| nG N : ^X(i,j) < 1 1 (t G [B], t G R + = [0, +oo)), (9) 

3=<H ) 

where by definition Ylj=i(- • • ) — if i > k. Thus Ni(0) = a% for each i G [B], and one 
could well have iVj(T) = +oo for some finite time T (indeed, that to/Z happen for our 
cases of interest); but in any case, the above defines a continuous-time stochastic process, 
and in fact the {Ni(-)}f =1 processes are independent. Each one of this processes is said to 
correspond to bin i, and each one of the times 

X(i, Oi),X(i, Oi) + X(i, cti + l),X(i, a,) + X(i, a, + 1) + X(i, <n + 2), . . . 

is said to be an arrival time at bin i. As in the balls-in-bins process, we imagine that each 
arrival correspond to a ball being placed in bin i. 

In fact, we claim that this process is related as follows to the balls-in-bins process with 
feedback function /, B bins and initial conditions (ai, . . . ,05). 

Theorem 4 (Proven in |3J 1141 Let the {-ZVj(0}ie[B] process be defined as above. 

One can order the arrival times of the B bins in increasing order ( up to their first accu- 
mulation point, if they do accumulate) so that T\ < T2 < ■ ■ . is the resulting sequence. The 
distribution of 

{I m = (JVi(T m ), N 2 (T m ), N B (T m ))} m&N 

is the same as that of a balls-in-bins process with feedback function f and initial conditions 
(ai,a 2 , • . . ,a B ). 
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One can prove this result 2 as follows. First, notice that the first arrival time T\ is 
the minimum of X(j,aj), (1 < j < B). By the minimum property presented above, 
the probability that bin i is the one at which the arrival happens is like the first arrival 
probability in the corresponding balls-in-bins process with feedback: 

Pr(Jf(i,Oj)= min X{j,aj) ) = — . (10) 

More generally, let t G R + and condition on (Ni(t))f =1 = (h)f =1 G N B , with bi > for 
each i (in which case the process has not blown up). This amounts to conditioning on 

bi-l bi 

Vi€[B] ^X(*A) <t< J2 X (iM 

j=a,i j=<n 

From the lack of memory property of exponentials, one can deduce that the first arrival 
after time t at a given bin i will happen at a exp(/(6j))-distributed time, independently for 
different bins. This almost takes us back to the situation of (jlOft . with bi replacing cij, and 
we can similarly deduce that bin i gets the next ball with the desired probability, 

f(bi) 



Ef=i/(6;)' 



3.2 On the eventual leadership event 

Before we move on to the main proofs, let us briefly discuss how the event ELead corre- 
sponding to eventual leadership by bin 1 can be expressed via the exponential embedding. 
We use the same notation and random variables introduced above, and in particular we 
use the embedded version of the balls-in-bins process defined above. However, we restrict 
ourselves to the B = 2 case with 

+oo 

]T/(jr 2 <+°c. (ii) 

Notice that this condition implies we are either in the monopolistic or in the eventual 
leadership regimes. Assume we start the process from state (x, y) G N 2 with x < y (i.e. 
bin 1 has less balls than bin 2). The event ELead is given by 

ELead = {3m > OVM > ml m (l) > I m (2)}. 



2 The exact attribution of this result is somewhat confusing. Ref. cites the work of Davis [1] on 
reinforced random walks, where it is in turn attributed to Rubin. 
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This can be restated as follows. For i G {1,2}, let Ur be the first m G N such that 

(i) 

Im{i) = r ) or set Ur = +00 if no such m exists. Then 

ELead = {3r > 0\/R > r U$ < U§ ] }. 

This carries over to the continuous-time process, in which the time it takes for bin 1 to 
reach level R is J2f=x ( 1 , J ) , and the analogous time for bin 2 is J2f=y X(2,j). It is easy 
to show that 



ELead 



R-l R-l 



3r>0Vfl>rJ]x(l,i) <^X(2,j) \ (12) 
j=& i=v ) 

R-l y-i 1 

3r>0yR>r^2(X(l,j)-X(2,j))-Y / X(2,j) <0\ . (13) 
j=y j=x ) 

The key now is to show that Ylf=y(X(l,j) — X(2,j)) converges as R — ► +00. Indeed, 
the random variables in the sum, 

X(i,j),i e {l,2},j>y 

are independent, and each term in the sum has zero mean (since X(l,j) = d X(2,j) = d 
exp(/(j))) and variances that add up to (cf. (JJJ)) 

R-l R-l „ +°° 9 

E Var - x &i)) = E 777)2 - E 77^2 < +°°( b y CO)- 

Kolmogorov's Three Series Theorem then implies that ^2,^= y {X(l, j) — X(2,j)) 6 R is a 
well-defined random variable, as stated. Moreover, the event in (|13|) holds if and only if 
Eg(I(l,i)-I(2,i))-Egl(2,i) < 0, except for a null event, because 
X(2,j)) and Y^Zx^-Qj) are independent (by the definition of the exponential embed- 
dings) and have no point masses in their distributions. It follows that 

(+00 y-1 \ 

Y^(X(l,j)-X(2,j))-^X(2,j) <0 . (14) 
J =2/ i=!B / 

This equation is fundamental to our proofs. 
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4 Escaping a very likely defeat 



In this section we present the proof of Theorem [21 For convenience, we have divided our 
argument into four parts. In Section 14,11 we outline our proof method, which consists 
of careful estimates of Laplace transforms. Such estimates are carried out in Section 14.21 
and Section 14.31 Those results are collected and applied to the proof of the Theorem in 
Section IPl 



4.1 Our method of proof 

As usual, our proof begins by writing down the event under consideration in terms of the 
exponential embedding random variables, using in this case Q14|) with (x,y) = [t,a\. 

(t-\ce£\-l +oo \ 

Y Y (X(l,j)-X(2,j)) <0 \ . (15) 

j=\at\ j=t-[at] ) 

Hence, if we define the following independent random variables 

t-\at\-X 

A t = y ( 16 ) 

j=\at\ 



hOO 



A t = Yl (X(hj)-X(2,j)), (17) 

j=t-\at\ 

and let Z t = A t + A t , we deduce that 

Pr M (ELead) = Pr (Z t < 0) (18) 

and that, for all A > 0, 

Pr M (ELead) < Ex [exp(-XZ t )] = Ex [exp(-AA t )] Ex [exp(-AA t )] . (19) 

Thus the "standard trick" of employing the Laplace transform provides an upper bound 
on ELead. We now use a less standard trick for lower bounding this probability in terms of 
the same Laplace Transform. Our approach is essentially that of Spencer |13j . 

Let A > and rji > 772 > be given. Then 

Pr(Z t < 0) > Pr(-r ?1 < Z t < - V2 ) > e"^ 1 Ex [e- XZt X{- m <z t <- m }] , (20) 

since —771 < Z t < 772 implies that Ar/i > —XZ t . Now let e > be fixed. Then, if Z t > —772, 
then — XZt < — (1 — e)XZ t — r/2£, and if Z t < — r/i, then — XZ t < — A(l + e)Z t — eAr/i. Thus 

e -\Z t < e -(l+e)XZ t -Xe m + g - (l-e)A Z t -tXm 

on the complement of {—7/1 < Z t < —^2}- (21) 
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Hence 
Ex 



\ 2 

e Xz t e(-f7i,-»72) 



> Ex 



-Aerji 



Ex 



-(l+e)AZ t 



Ex 



-(l-e)AZ t -eAT) 



whenever the Laplace transforms above are finite. Plugging this last inequality back into 
1)20(1 yields the following general lower bound. 



Pr (Z t < 0) 

> e^Ex 



-xz t 



e -Aer?i Ex ^-(l+e)AZ t j + e -Aer/2 Ex ^-(l-e)AZi 

1 Ex [e~ XZt ] 



(22) 



How can one use the upper and lower bounds above? For the sake of understanding what 
follows, let us indicate how inequalities (|19|) and (|2"2"|) are typically employed. Assume that 
there is a choice of A* = A£ that minimizes or nearly minimizes the expression 



h t (A) = ^lnEx 



(23) 



Then one could hope that h' t (X*) ~ 0, h'l{\*) > 0, and that there would exist a constant a 
not depending on t such that for all 5 > small enough 



h t ((l±5)X*) < h t {X) + a5 2 . 



(24) 



Thus our main expectation is that ht has an minimizer A* and that it behaves like a "nice" 
strictly convex function around A* in a way that does not depend on t. Now assume that 
e is small enough (but fixed) and we set r\\ = y/et/X, r\2 = y/et/X in (|2l)|). then 



e - A "i £ Ex [ e -(i+«0A* 2t" 
Ex [e" A *^] : 



= exp{[^((l + e)A*)-fc t (A*)-e]i} 
< exp{(ae 2 -e 3 / 2 )t} = e-^). 



and similarly 



e - A " 2€ Ex [e- 



-(l-e)A* Z t ] 



-n(t) 



Ex [e- A * z '] 

Thus in this case, (120(1 and (with the choice of A = A*) would imply that 

(1 - e -myh t {\*)t-^et < p r ^ < Q ) < e ft t (A*)t_ 



This last expression would imply that 

lnPr(Zt < 0) 



h t (X*) 



<e-o(l) < 



<h t (X*) + o(l) for t » 1, 



(25) 
(26) 

(27) 
(28) 
(29) 
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for all small enough e, which shows that 

Urn '" Pr(Z ' <0) -MA-) = 0. (30) 

t^+oo t 

The above exposition does not exactly correspond to our proof of Theorem [2 However, 
the spirit of the two proofs is the same. That is, we will show that the logarithms of our 
Laplace transforms are "strictly convex in the limit", and use that to prove the desired 
result. 



4.2 Analysis of the Laplace Transform 

To apply the above method, we need to analyze the Laplace transform of Zf = + Af. 
We start with Ex [exp(—XA t )]. 

t-M-i j 

Ex[exp(-A^)] = [] — x (31) 

j=\at\ i + 

= -p \ J: In 1 (32) 

With foresight, we parameterize A = X(p) = p(l — a) p t p , for some p > 0, and deduce that 

Ex [exp(-XA t )] = (33) 
't-\a£\-l 



exp 



j=\at\ 



1 



1 + 



(l-a)Pp 



exp < i x 



t-[otl-l 

7 E >" 



1 



i + 



(l-q)Pp 



(34) 



(35) 



It is easy to see that the bracketed term is (close to) a Riemmann sum. In fact, the function 
u i— > ln(l/(l + (1 — a) p p/u p )) is monotone increasing, so for any \at\ < j < t — \at], 



< 



3 + 1 



In 



1 



1 + 



(l-a)Pp 

vP 



dv In 

t 



1 



1 + 



(l-q)Pp 

U/t) p 



(1-a) 



2±1 
(1-a)* 



J 

(l-a)t 



In 



1 



1 + 



(l-a)Pp 



In 



, (l-a)Pp 

1 + 



In 



1 



1 + 



(l-a)Pp 



• (36) 
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Summing over j then yields 

t-\at\ 



, (l-a)t 

< (1 - a) / In 



1 

< - 
~ t 



In 



1 + 



((t-\at\)/t)P 
(1) 



07*)* 

In 



1 + 



(l-a)Pp 



• (37) 



Thus we deduce that for all 5 > there exist C = CTy > 0, Tg G N such that if a > 6 and 
t > T, (1) , then 



8 ' 

(i) i *-r«*i-i / i \ r 1 ( i 

^ £ M TTo=^ - (1 " a) 7 - ln (lT^ 



a 



< (38) 



It follows that, for all a > 5 and i > 

In Ex [exp(-A(p)A t )] 



c 



(1) 



(39) 



We now consider the Laplace transform of A t , with the same parametrization A = A(/o) as 
above. 



Ex 



+oo ^ 

[exp(-AA t )] = [] ^ 

, r ,n t 



{+oo 
In 
j=i-ra*l 
+oo 



1 



OA)* 



exp < 



j=t-ra*l 



. _ (l-n) 2 V 
1 07*)* 



(40) 
(41) 
(42) 



Notice that this Laplace transform is infinite for A > i — [at] , and we therefore place the 
restriction p G (0, 1) to ensure that does not happen for all large enough t. As above, 
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we have a something close to a Riemmann sum between brackets. Indeed, since the map 
u i-> ln(l/(l — (1 — a) 2p p 2 / u 2p )) is monotone decreasing, for all j >t — \at] 



< 



t ln L _ (l-a) 2 V / ^ ( 1 - (^) 2p P 2 ) 
V 1 07*)*" / a— 



ln 



«2p , 



< 



1 



ln 



1 



t i _ (i-«) 2p P 2 
> 1 07*) 2p 



tli (i- a ) 2 V 

V 1 w+mF J 



(43) 



Summing over j, we conclude that 
1 +^ / 1 

3=t-\at\ V 1 (j/t)*P 



•+oo 

(1-a) / In 



Mat] I 1 _P~ 



< - In 



, _ (l-a) 2 V 
■ ((t-|a|)/t)* 



(44) 



This implies that for each 77 G (0, 1) there exist > and T^ 2 - 1 G N such that for all 
t>T^\ if < p < 1-?? 



(^( 2 ) 1 /• 
- -5- < -lnEx[exp(-AA t )] -(1-a) ^ 



ln 



\ 1 — -C- 

(i-«)t \ x S 3 ?. 



cZit 



< 



(2) 



To conclude the section, let 



g t (p,a) = lnEx[exp{-p(l-a)ftP[Z t ]}] 

t- [at] 
= £1* 



(45) 



(46) 



1 



+00 



(l-a)Pp 
1 



j=t-ra*l 



1 



(l-a) 2 V 



,a€ (0,l/2) )P e (0,1). 
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Also define, for the same range of a, p, 



P+OO / 1 



(47) 



From the above, we deduce that for any 5 > 0, if we let Cs tV = + cffi then = 
T 5 {1) +T^ 2) , then 



V«£ (5,1/2), p<= (0,1), t>T 5 , 



9t(p,a) 



-F p (p,a) 



< 



(48) 



4.3 The asymptotic form of the Laplace transform 



We now analyze the function F p (p,a) introduced above, as well as its minimum over p, 
which we will prove to precisely the function in the statement of the Theorem. 



c p (a) = inf F p (p,a) (a G (0,1/2)). 

/06(0,1) 



We have the following formulae for all a 6 (0, 1/2) 

0, 



limFp(p, a) 

p\o 

Um F p (p, a) 
p/ 1 



+oo, 



1 9F„ 



1 — a dp 



(p, a) 



du + I 2P 2P 2 ^, 
+ (0 7i « p - p 



i a 2 K 



1 — a d 2 p 
Notice, then, that 



£ (P,«) 



- f — - — du+ f 

J*vP + p J 1 

1—a 

f 1 1 . f + °° 2u 2 p + 2p 2 

J (yP + pY dU + J 1 (u 2 P - P 2)2 



du. 



(49) 

(50) 
(51) 

(52) 
(53) 



d 2 F 1 
Vet > inf nr , p (p, a) > inf (1 — a) / -77 du 

P e(o,i) d 2 p VP ' ' ~ P e(o,i) V ' (vP + p 2 

f 1 1 

= inf (1 — a) / -. —77 du = a a > 0, (54) 

pe(o,i) V 7 7^ uP + 1 2 

1—a 

hence -F p (-, a) is a strictly convex function of p, for any a G (0, 1/2). Moreover, 

ap /"l 1 

lim — ^(p,a) = -(1-a) / — du < 0, (55) 
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The two last assertions prove that for any fixed a: 

1. F p (-, a) is a strictly convex function of p; 

2. F p (p,a) < 0, dpF p (p,a) < for all small enough p; 

3. F p (-, a) thus has an unique minimum over (0, 1), and this minimum is achieved at the 
unique value p* = p* (a) such that 

8F 

yV(a),«)=0; (56) 

4. by the definition of c p {a) and the above items, 

c p (a) = inf F p (p, a) = F p (p*(a),a) < 0; (57) 
P e(o,i) 

5. by strict convexity (and using the definition of a a > above) and the fact that 
dpF p (p*(a),a) = 0, then there exists a value b = b(a) depending continuously on a 
such that, for all e > small enough, 

F p ((l ± e)p*(a), a) < F p (p*(a), a) + b a e 2 = c p (a) + b a e 2 ; (58) 

6. in fact, we can strengthen the previous item and say that for all 5 G (0, 1/2) there 
exists Bs, es > such that 

Va € (5, 1/2 - S),e G (0, e s ),F p ({l ± e)p*(a), a) < c p (a) + S 5 e 2 ; (59) 

We now prove that c p is a smooth function of a with a positive derivative. To prove 
smoothness, we only need to show that p* is a smooth function of a, since c p (a) is given 
by the formula in (|56|) , But recall that we have shown that p* is uniquely defined by the 
equation 

BF 

G(p*(a),a) = -^(p*(a),a) = 0, (60) 

and we know that 

dG d 2 F 

~dp ^' ^ = ~Wp^ P ' a ^- aa> °' ^ 61 ) 

Hence the Implicit Function Theorem applies |12j . and implies that p* is indeed a smooth 
function of a. 

To prove that c' (a) > 0, we first differentiate F p (p,a) with respect to a. 

^(p,a) = (l-2a)log \ 1 + Y-^y) -^^(ft")- ( 62 ) 
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Now notice that, by the chain rule, 

c' p (a) 

(via [d p F p ](p*,a) = 0) 



d_ 

da 
8F. 



(F p (p*(a),a)) 



dF n 



OF 



p (p*(a),a) 



dp 

(by (El) = (1- 2a) log ^1 + 
(by item 4. above) = (1 — 2a) log 1 + 



P 



1-a 
P 



(-*- 

\l-a 



F p (p,a) 
1 — a 

Cp(a) 
1 — a 



Cp(a) 
1 — a 



(omitting a > term) > — 
(since c p < 0) > 0. 
Finally, we show that 

p*(a) is a non-increasing function of a. 
This is important because it implies that, for all a > 5 > and all small enough e, 

(l + e)p*(a) < (l + e)p*(S) < r) < 1 
for some r\ = rjs depending on 5 only. In conjunction with (|48|). this will imply that 



Vae (S,l/2),pe (0,l),t>Ti,e€ [0, e<5 ] 

± e)p*(a),a) 



F p ((l±e)p*(a),a) 



< 



where C$ = Cs t rj s depends on 5 only. 
To prove (|7()|). we notice that 



(1 - 2a) log (l + ^ty) - j^F p (p, a) 



p=p*(a) 



(1 - 2a) 



(T^)P+P*( Q ) 



1=3 T^(P* (")>«) 



(1 " 2a) ^U) >0 ' 
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where (j73j) follows from 

[d p F p ](p*(a),a) = 0. 

Hence, if (5 > is small enough 



BF BF 

p (p*(a),a + (3)>^(p*(a),a)=0. (74) 



dp dp 

But by the strict convexity of F p (-, a + 0), 

BF BF 

-qZ(j>, a + P)<^-(p*(a + (3), a + (3) 

for all p < p*{a + 0). Hence p*(a + 0) < p*{a) whenever (5 is small enough. This finishes 
the proof. 

4.4 Proof of Theorem El 

We now have all the tools necessary to prove Theorem |2j 

Proof: [of Theorem[2j Let us now apply the upper and lower bounds (fTH)) and (|2*2*|) presented 
above. We will assume that 5 < a < 1/2 — 5 for some constant 5 > 0, and prove bounds 
on Pr[t lQ! ] (ELead) that are uniform on that range of a. 

In the current setting, Zt = At + At and we have defined 



gt(p,a) = In Ex 



e -xz t 



\X=p[(l-a)t]P ! 



hence for all fixed p G (0, 1), a G (5, 1/2 - 5) and t > T 5 (cf. fftty ) 

Pr M (ELead) = Pr (Z t < 0) < exp(^(p,a)). (75) 

In particular, setting p = p*(a) and t, a as above, we can use (|71|) to bound 

InPr [t>a] (ELead) < exp(c p (a)t + C s ). (76) 

The above upper bound can be matched via the lower bound method in (|22|). Let e > 0. 
One can set A = p*(a)[(l — a)t] p , rji = yfet/X and r\2 = \/et/2X in (|2*2^1 to deduce 

Pr M (ELead) = Pr (Z t < 0) 

/ fft ((l+e)p%a)-6 3 / 2 t 9t ((l-e)p*,a)-e 3 /2i/ 2 \ 
> e 9t(p*,a)-^ 1 _ £ £ . (77) 
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Now notice that and any < e < es (cf. ((ST))) and l|71|0. one has that 

g t (p*,a) = c p (a)t±Cs, (78) 
fft ((l + e)/,a) = F p ((l + €)/>*, a) titf* (79) 
(by (EU) and jni) < c p (a)t + B a e 2 t + C s 

£ 3/2 

(for small enough e) < c p (a) t H — — i + Cg, 

g t ((l-e)p*,a) = F p ((l - e)p* , a) t + C s (80) 
(by JSHI) and GB) < c p (a) t + 5 a e 2 t + C.5 

£ 3/2 

r 



(for small enough e) < c p (a) t H — t + C$. 



Substitution back into (f77|) yields 

Pr M (ELead) > (1 - 2e c ^ eilH/i ) exp{c p (a)t - C 5 - ^t}, t > T s , (81) 

for any small enough e and any t > T^. In particular, if we set e = [(Cs + ln4)(4i)]~ 2 / 3 , 
then e \ as i -> +oo, so that for all large enough t the above formulae apply and 

P rM (ELead)> ^fat°»;-^ t2/3 ) , (82, 

for some constant C' & > Cs- Redefining Cs and Ts if necessary, we can then conclude (using 
(JTHJ) and (HU)) that 

Va 6 (5,1/2 - <5),Vt > T 5 , c p (a) t - C s t 2/3 < lnPr M (ELead) < c p (a)t + Cs- (83) 

Since we have already shown that c p is smooth and monotone-increasing in a (cf. Sec- 
tion the Theorem follows. □ 



5 The most likely escape path 

This section is dedicated to Theorem |31 After some preliminaries are considered in Sec- 
tion 15.11 we then estimate (in Section 15.2(1 the transition probabilities of the balls-in-bins 
process conditioned on ELead. Those estimates are used to show in Section 15.31 that 
for short enough times, the conditioned process evolves from a state [t, a] to a state 
[(1 + rj)t, a + g P (a)rj], thus staying close to the tangent of the ODE. The final steps 
of the proof are presented in Section 15.41 and a Lemma used in Section 15.31 is proven in 
Section 15.51 
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5.1 Preliminaries 

According to Theorem |5J the map 



: (0,|) -» R- 

.. in Pr [f , a] (ELead) (84) 
a h-> hm^+oo ^ ^ 



is infinitely differentiable. In particular, this means that, for all 5 G (0, 1/2), the suprema 

(r£NU{0}) (85) 



D { p = sup 



5<a<\-8 



1 d r C p 

(«) 



r! da 

are all finite. Moreover, c^(a) > on (0, 1/2), and 

4 1} = min c'Ja) > 0. (86) 

5<a<i-5 

Theorem |21 also tells us that, for 5 as above, there exist Tg G N and (7,5 G R + such that for 
all t > T tf and all a G (-5, 1/2 - 5), 

-C^ 2/3 < lnPr M (ELead) - c p {a) t < Cg. (87) 

Therefore, if a, a' G (5, 1/2 — 6) and t, t' > Tg 

e c p ( a >)t'-c p ( a )t~c s (i+ty :i ) < Pr [f>'] (ELead) ^ eCp(a ' )t '_ Cp(a )t_c 5 (i+(0 2/3 ) 

Pr[t, a ] (ELead) 

Moreover, if we also have that a — ol = e, t' — t = rjt for e, rj < 1/4 (say), then 



1 Prr t / q/i (ELead) f 
1 ln Pr M (ELead) " + 6 ^ (a) (1 + ^ 



< 



+ ^07 + ^(1 + ^). (89) 



This last equation will be repeatedly used in what follows. 



5.2 Transitions conditioned on ELead 

Define 

[t, a] i-» [t*,a*] a, a* G (0, 1), t,t* G N, t < t* , (90) 

to be the event that the initial state of the process is [t,a], and that at time t* — t the 
state of the process is [t* , a*]. The goal of this section is to estimate the probability of the 
transition 

Pr M (ka]~[t*,a*] | ELead) (91) 
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for the case when t < t* < (1 + rj)t for some 77 > and t is large, and we assume (for 
simplicity) that at, a*t* are integers. We will require that a ± rj E (5, 1/2 — 6) for some 
< 5 < 1/2, so that for all c G (a — 77, a + 77) the bounds on Pri> c ] (ELead) coming from the 
previous section apply with the same value of 5 (and thus the same C$, T$). Notice that 
we can assume that 

a 

5<a-r,< < a* < a + r] < 1/2-5, (92) 

I + 77 

otherwise the given probability is 0. One has that 

Pr M ([*,<*]->■ [t*,a*] I ELead) (93) 
Pr M (ELead | [t,a] ^ [t*,a*]) Pr M] ([t,a] ^ [t*,a*]) 

Pr M (ELead) 
Pr r;Ct1 (ELead) Pr^ ([t,a] ^ [t*,q*]) 
Pr[ t)Q! ] (ELead) 

where the first line is Bayes' Rule, and the second follows from the Markov Property of the 
balls-in-bins process. Using the bounds in (|92|) and (|89|) . 

Pr [t ., a .] (ELead) = ^ {a)(f + _ a)f ± p (2) + t) _ (g4) 

P r [t,a] (tLead) 

It will be convenient to have the above equation in a slightly different form, 

Pr^] (ELead) = ^ (q)(4 , _ t) _ a( j {a){f _ t)±{D f) +D f) )r ? ^(^(oj^f-^ (95) 
P r [t,o] (tLead) 

As for 

Pr M ([t,a] ^ [t* ,a*]), (96) 
notice that there t* — t = r]t and that a*t* — at, hence there exist 

t* - t 
a*t* — at 

ways of moving from state [t,a] to state [i*,a*]. For each one of those ways, each step in 
which a ball is added to bin 1 has probability 

(at + a)P 



(at + a)P + ((1 - a)t + b)P 



of occurring (for some 0<o, b <t* — t < rjt), whereas steps in which a ball is added to bin 
2 have probability 

((l-a)t + b)P 
(at + a)P + ((1 - a)t + b)P' 
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for a,b as above. There exist absolute constants Rs and rjs only depending on 5 such that, 
if < 7/ < rjs: 

C -R*v aP < ( at + fl ) P <f A» aP 



«p + (1 - a)P ~ (at + a)P + ((1 - a)t + b)P ~ dP + (1 - a)P 
and 

aP + (1 - a)P ~ (erf + a)P + ((1 - a)i + 6)p ~ aP + (1 - a)P ' 
and therefore, any path moving connecting state [i, a] to state [t*,a*] has probability 

(n \ a*t*—at / r> \ t* —t—la* t* — at) 

— , ) fi — r , i , or) 
aP + (1 - a)P J V aP + (1 - a)py v ; 

(any such path must have a*t* — at steps bin 1 receives a ball, out of a total of t* — t steps) . 
Letting 

a p 

Hy ' aP + (l-a)P 

we conclude that 
Pr [tM ([t,a]^[f,a*]) = e ±R ^ 2t 

x ( a ^:^)pH Q * t *- Qt (l-Ka))^- t )-^ t *-^, (98) 

and that (cf. (1351) 1 

Pr [tia] ([t,a]^[t*,a*}\ ELead) 

= exp{c p (a)(f - i) - a C ;(a)(t* - t) ± (ifc + L>J 2) + dJ 15 ) r/ 2 i} 

JlrjJ) \p(a)^ a Y tt ~ at (1 - K«)) (r "* )-(an " at) • (99) 
for all f > Ts, < 77 < r/s, t < t* < (1 + and a* as above. 

5.3 The most likely transitions 

We continue with the same setup as above, and state a useful lemma that we prove in 
Section 15.51 

Lemma 1 Let a > 0, < p < 1 and an integer m G N, m > 2 be given. Define, for 
n G [771] U {0}, 

6(n) = r^p ft a n (l -p) m - n 

T/ten 
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1. the sequence {b{n) | < n < m} is unimodal; 



2. maxo< n < m b(n) is achieved at no 



pam+(l— p) 
pa+{l-p) 



3. for all K>0, E\n-n \>K^K™) < mb(n )e W=^> . 
To apply this lemma, we give new names to familiar quantities. 



m = m(t* ,t) 


= t* - t < 7]t 




(100) 


n = n(t, t*a, a*) 


= a*t* — at 




(101) 


a = a{ot) 


— e c' p (a) 




(102) 


P 


= P(a) 




(103) 


n = n (a,t*,t) 


pam + (1 
(pa + 1 - 


-P)] 
P) ' 


(104) 



With those definitions, 



no < pam + (1 - p) 1_ < J 
m ~ (pa + 1 — p)m m ~ 



(MM) 

(pa + 1 — p)m 



For a G (5.1/2 — 6) , a < e d s ' . Moreover, p(a) is a continuous function of a that is between 
p(5) > and 1/2 for a as above. Thus there exists a constant U = Us £ (0, 1) such that 



n 



m 



<1-U S 



and therefore the Lemma implies that, for K > 

-..ill 

y b(n) < mb(no)e 2U s . 

\n—n \>KyJm 

Now notice that, in the present case, Q99|) implies that 

Pr M ([*, a] ■"►[**,<*] | ELead) 



(105) 



b(n) 



exp{c p (a)(t* - t) - ad p (a)(t* - t) ± (i? 5 + + L>J X) ) t? 2 *} 



(106) 



It follows that for all K > 0, a e (5, 1/2 - 5), t > T s , < n < qg, t < t* < (1 + 77)* and a* 
as above 



Prrjj ([t,a] 1 ^ [i*,a] with |n(£, i*, a, a*) — no (a, t* ,t)| > K^fW^t I ELead) 

< (f - t) exp | 2 (^ + flf) + Df) rft-^-V 



(107) 
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To use this formula, we will assume that t* — t = nt (i.e. equality instead of the above 
inequality), and then set K = ^/rjt. Then 



Pr [tja ] ([t,a] i ^ [t*,a] with \n(t, t*, a, a*) - n (a,t*,i)\ > n 3/2 t | ELead) 

(108) 



and (by making 775 smaller if necessary) we can ensure that there exists V5 > such that, 
with < 77 < 775, 



Pr 



>a ] 1 ^ [t*,a] with |n(i, t*, a, a*) - n (a,t*,t)\ > rf /2 t | ELeacf) 



[t 

<e- y ^ 1 . (109) 

To conclude this part, we look at how a* behaves when 

\n(t,t*,a,a*) - n (a,f,t)\ < r] 3/2 t. 



In that case, 



As defined above, 



ttt + n a In 

a = = 1 . 

t* I + 77 1 + 77 t 



no _ 1 
t ~ t 



pant + (1 — p) 
{pa+1- p) 



pan 1 - p 



pa + 1 — p (pa + 1 — p)t 



Hence, by making 77,5 even smaller if necessary, we can guarantee that 

a* = a(l - 77) + P 7 ± ( 2r/ 2 + - 1 ~ P - + V 3 A (110) 
v /; pa + l-p V {pa+l~p)t ) 

Hence, recalling that 

. pa a p e c 'p^ 

g c a = — a H = —a H r-^ , 111 

M 7 pa + l-p aPe4(«) + (l- a )p' V 7 

and noticing that there exists a constant Q<5 such that (for the above range of a, 77, etc) 

l - p K Qs 



(pa + l — p)t t 
we deduce the following bound: 



/ a* - a \ 

Pr M l[t, a] ^ [(l + r/)t, a*] : — — = 5p (a) ± (2 V + ^ + | ELeadJ 

^l-e - ^"*. (112) 
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This holds for all 5 < a < 1/2 — 5, < rj < r/s and t>Tg. Notice also that we had assumed 
that ai G N, but this restriction is unnecessary if we make Q$ larger. To summarize our 
conclusions, we state them in slightly modified form as the following Lemma. 

Lemma 2 For each 5 E (5, 1/2 — 5), there exist constants r/s S R + , Vs,Qs £ R + and 
Ts G N such that for all integers t* > t >T$ such that < 77 = ^ — 1 < rjg, 

Pr M ([t, a] ^ [(1 + v )t, a*] : = 9p (a) ±Q r) L/fj+ | ELead^) 

> i_ e -^ (*'-*). (113) 

Notice that the constants appearing in the Lemma might be slightly different than those 
appearing before it, but this is nothing but a slight abuse of notation. 

Remark 1 Let us now show why g p (a) > always, as stated in the introduction to this 
chapter. Choose 5 < a* < a < 1/2 — 5 be fixed, but also close enough to a. Then 
cpipt) > c p (a*) always (by Theorem HJ), and one can easily deduce from the reasoning 
proving 189\) and Lemma\^ that for all ij > fixed (but small enough) 

Pr M ([t,a] 1 * [(l + n)t,a*] \ ELead) < e~ v ^. 

There are r\t choices for the number of balls in the first bin at time rjt, and one can easily 
deduce via an union bound that 



Pr[ t Q ] ([t, a] 1 ^ [(1 + i])t, a*] for some a* < a \ ELead) < e 



-n(t) 



(Notice that, as shown above, only a* > a/(l + if) > a — n need to be considered, so 
one can pick r] so that a* > 5 for all relevant a*.) On the other hand, we know from 
Lemma\^that with very high probability [t, a] evolves into a state [(l + n)t, a*] with a* — a = 
g(a)r) + O (ry 3 / 2 + 1/t) . If g(a) < 0, we could pick a small enough rj and a large enough t 
such that a* < a with overwhelming probability, but this was shown to be impossible above. 

5.4 Proof of Theorem El 

We now have what we need to prove Theorem El 

Proof: [of Theorem |3] The idea of the proof is to iterate uses of Lemma [^J which shows 
that, typically speaking, for any small n, the fraction a* of balls after rjt time steps in bin 
1 stays close to the straight line passing through a with slope g p (a). Since the solution 
A(-) = A P)0l {-) of the ODE also stays close to those straight lines (at least locally), this 
technique will give us the desired result. Let a £ (0, 1/2) and < K < T PjCr be as in the 
statement of the theorem. Choose some L E (K,T p ^ a ) and let 5 be such that 

5<minja-e, - - A(L) - e\ (114) 
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for some < eminja, 1/2— A(L)} (as discussed in the introduction, A(s) < 1/2 for s < T Pt0l , 
so such an e exists). 

Now recall the notation from Lemma [2 and assume (as we might) that t satisfies 

r £ i/3-i 

t > T s ,7] = rit = L -^ 1 < mm{ m , L - K} (115) 

Clearly, all that (|115|) requires is that t is large enough. Assuming it holds, there exists an 
integer iVj G N such that 

K < r) t N t < L (116) 

and we will assume, without loss of generality, that in fact rj t N t = K < L. We also define, 
for convenience 



S<z<l/2-S r - 



dz 



(r £ N), (117) 



which is a finite quantity since g is infinitely differentiable on (0, 1/2). 

Recall that we are starting the balls-in-bins process from state [t,a]. We will first look 
at the differences 

Aj = \a(jri) - A(jrj)\, j G [N t ] U {0}, (118) 
and show that these differences remain small with high probability. At j = 0, 

a( 0) = M = A(0) ± i 

so the differences are small at the start. Now assume that, for some j £ [Nt] U {0}, after 
conditioning on ELead, 

With probability > 1 - P j; Vi G [j] U {0}, A, < 7i < e. (119) 

where ^ 

< - = 70 < 7i < • • • < 7j < e 

and Pj G R + . We will show that (again conditioning on ELead) 

With probability > 1 - Pj - e Vs i+k, (120) 
Vi G [j + 1] U {0}, Ai < j h where 

7 J+1 -7 J + (4 1) +GiV + Q,(. 3/2 + ^). 

(Here, W$ > is a constant depending only on 5). To prove this, let us condition on a 
value of oit{r]j) that is compatible with the event described in (|119|) . This means that 

ativj) = oij with \A(rjj) - aj\ < jj. (121) 
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In this case, since jj < e and A is increasing, 

5 < a - e = A(0) -e< aj < A(L) + e < i - 5. (122) 

Using the Markov Property of the balls-in-bins process shows that 

Pr M (A j+ i < 7 i+1 | ELead,a t (r?i) = ay) = Pr [(1+ ^ )4iQj] (A j+1 < -y j+1 | ELead) (123) 

since &t(s) is the number of balls in bin 1 at time \st] (i.e. when there are t+ \st] balls in 
the system) and in the present case rjt € N. To evaluate the latter probability, notice first 
that 

\A((j + 1)t?) - A(j V ) - V A'(j V )\ (124) 
= \A((j + l)v)-A(jTi)-n9p(A{jTj))\ (125) 
< Gfrf- (126) 

Moreover, by (|121jl and the choice of t > T$ one can apply Lemma |2] with (1 + jrj)t 
replacing t and 77/(1 + j'77) replacing 77 to deduce that, conditioned on &t(jv) = a j as above, 
the probability that 

\a t ((j + 1)77) - at(jri) - r,g p (a t (jri))\ < Q 5 (V /2 + ^f 1 ) (127) 

is at least 1 — e~ Vpfi/Q+jv) , When the two previous equations hold, 

I a* (so + J?) - A( s o + v)\ < l«t( s ) - ^( s o)| + v\9p(&t(so)) ~ 9 P (M s o))\ 

+ G?V + (t? 3 / 2 + 1) < 7i + (G« + G?> )77 2 + Q 5 (, 3 / 2 + I±*) . (128) 

Thus, for any aj compatible with Aj < jj, one has that 

Pr M (A i+1 < 7i+1 I ELead,a t (77j) = aj ) > 1 - e -W/(i+K), (129) 

from which (|12(J() immediately follows. 
Now notice that if 

,(1) , ^(2)^2 , n _ , 1 + K , 1 



7.V - iV|(G^ + + + ^1 + I j I < e 

then one can use Q119|) and (|12U|) repeatedly to deduce that 

Pr M (Vj e [N] U {0},Aj < lN | ELead) > 1 - jVe~ W(1+X) . (130) 
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But notice that N = K/r/, so a simple calculation shows that 

for all large enough t,~/N < Wgt" 1 ^ 3 

with Ws G R + depending only on 5. Hence 7tv < e for all large enough t, and for such t 
(|13U|) holds. Finally, one can easily show that in the event described by (|13Uj) . 



Vj G [AT - 1] U {0},Vs G (0, rj),\a t (jTi + s) - A(s)\ <2r ] + lN = (^ 2 / 3 ) ,t » 1. (131) 
Hence, (|130jl actually implies that for all large enough t, 

Pr [M I sup \a t (s) - A{s)\ < Ws^ 1 ' 3 | ELead ) > 1 - O (t 2 ' 3 ] e ~v s t^ /(i+K) ^ (132) 
' \s6[0,^] J v ' 

for a possibly larger Ws- Since 5 is ultimately defined in terms of a and K, (|132j) implies 
the Theorem. □ 



5.5 Proof of Lemma [T] 

To conclude the chapter, we prove Lemma ^ 
Proof: [of Lemma ^ Notice first that, if < n < m 

b(n) _n+ll-p (133) 



6(n + 1) m — n ap 

Notice that x \— > (x + 1/m) /(1 — x) = (x + 1/m) 5Z/>i x ^ * s an increasing function of x that 
is equal to 1/m < 1 at x = and goes to +oo as x — > 1. Hence, if 

pa L _ 1-p 

1-p m __ P a m 



X Q ~ , pa 



1 + ^ P a + (l-p) 



-p 

then 

X > Xq 

x = xo (134) 

X < Xq 



Vx G [0, 1) 



X- 


177 


1-p 


> 


1 


-£ 


ap 


X- 


1-p 




1 


-ar 


ap 




X- 


1-p 


< 


1 


— cc 


ap 



As a result, if we let no = |~xom] (which is the same definition as in the statement of the 
lemma), we have that (using l|133[) ) 

V0<j<m-n 77 — -rr- = II — ^ — r~ > 1 

fr(n +j) ± i b( n o+i) 



27 



and similarly 



VO<i<n ^M^ = n Kno-^ + l) <1 
b{n -J) fJl b(n -i) 



This proves the first two items in the lemma. As for the last one, it suffices to show 
that for all j > K^frn 

b(n +j),b(n -j) < b(n )e~^^ 

We only prove the first inequality; the proof of the second is almost identical. As before, 
we write (using (|133|) ) 



b{n +j) = b(n ) I I - — — — - = b(n ) I I — — - jz r 

Now notice that (using the definition of xq and no) 



j— i- m — (no + 1) \ /m-iio\r 



-p-r )/t — yii-Q -r / in — ftp \ tt ^ t 

vi=i n o + « + 1 y V n o + 1 / V m - n 

/ m - n V / i 

^ brxr exp "Z. 



no + 1 / \ ^ m-n 

i-s Y , i(i + i) v 



x + l/mj 2(1 — x)m 

where x = n§jm is bigger than a;o- As a result of (|134|) 

1 — x ap 

; < 1 

x + 1/m 1 — p 

and putting this together with the previous inequalities 



b{n Q +j) ( pa V( \-x\ j(j + 1) i(j' + l) 



b(no) \(1 — p) J \x + l/mj 2(1 — x)m 2(1 — x)m' 

which, together with the fact that j > K^Jm, finishes the proof. □ 
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6 Open problems 



The results proven here only apply to feedback functions fix) = x p . However, we have been 
able to prove other results for more general functions; see [Sl llOlITT] for several examples. It 
would be interesting to see extensions of the present work to those other feedback functions 
as well. 

Another open problem is to determine the asymptotic behaviour of the ordinary differ- 
ential equation in Theorem especially whether the solution blows up in finite time (i.e. 
Tp. a < +00). We conjecture that this is not the case, but a proof would require a careful 
analysis of the ODE. 
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